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Serial Storage of Ink and Its Properties 
Cross-Reference to Related Applications 

[01] This application is related to U.S. Application No. 60/212,825, entitled "Methods for 
Classifying, Anchoring, and Transforming Ink Annotations'*, filed June 21, 2000, and to 
U.S. Application No. 09/750,288, entitled "Classifying, Anchoring, and Transforming 
Ink", filed December 29, 2000, whose contents are expressly incorporated herein by 
reference as to their entireties. 

Field of the Invention 

[02] Aspects of the present invention are directed generally to apparatus and methods for 
controlling a graphical user interface (GUI). More particularly, aspects of the present 
invention relate to capturing and/or storing electronic ink. 

Background of the Invention 

[03] Typical computer systems, especially computer systems using graphical user interface 
(GUI) systems such as Microsoft WINDOWS, are optimized for accepting user input 
from one or more discrete input devices such as a keyboard for entering text, and a 
pointing device such as a mouse with one or more buttons for driving the user interface. 
The ubiquitous keyboard and mouse interface provides for fast creation and modification 
of documents, spreadsheets, database fields, drawings, photos and the like. However, 
there is a significant gap in the flexibility provided by the keyboard and mouse interface 
as compared with the non-computer (i.e., standard) pen and paper. With the standard pen 
and paper, a user edits a document, writes notes in a margin, and draws pictures and other 
shapes and the like. In some instances, a user may prefer to use a pen to mark-up a 
document rather than review the document on-screen because of the ability to freely 
make notes outside of the confines of the keyboard and mouse interface. 

[04] Some computer systems permit a user to draw on a screen. For example, the Microsoft 
READER application permits one to add electronic ink (also referred to herein as "ink") 
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to a document. The system stores the ink and provides it to a user when requested. Other 
applications (for example, drawing applications as known in the art are associated with 
the Palm 3.x and 4.x and PocketPC operating systems) permit the capture and storage of 
drawings. These drawings include other properties associated with the ink strokes used to 
make up the drawings. For instance, line width and color may be stored with the ink. One 
goal of these systems is to replicate the look and feel of physical ink being applied to a 
piece of paper. However, physical ink on paper may have significant amounts of 
information not captured by the electronic collection of a coordinates and connecting line 
segments. Some of this information may include the thickness of the pen tip used (as seen 
through the width of the physical ink), the shape of the pen tip, the speed at which the ink 
was deposited, and the like. 

[05] Another problem has arisen in the storage of electronic ink. While some applications 
permit the capture and subsequent rendering of ink, the ink can only be associated with 
the application in which the ink was captured. Thus, a user loses the portability of the ink. 
If a second application was to attempt to use the ink stored in conjunction with a first 
application, the second application will not be able to interpret non-standard properties 
associated with the ink. 

[06] Further, while data structures are known, the size of the data structure used to store 
information may become excessively large and cumbersome. An example of a data 
structure that permits the random storage of information is the interchange file format 
(IFF). While the IFF is a simple structure to compose, any data structure in IFF would be 
excessively large, as each separate element has to be separately defined. Redundant 
definitions for similar elements were repeated. 

[07] Accordingly, an improved system is needed for storing ink and its associated properties. 
Summary of the Invention 

[08] The present invention provides a flexible and efficient system, method, and data structure 
for receiving, storing, and rendering ink, thereby solving one or more of the problems 
identified with conventional devices and systems. 
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[09] Aspects of the present invention are directed to an improved system, method and data 
structure for storing ink and its associated properties. The properties may be associated 
with tags. 

[10] In some aspects of the invention, a data structure is used having a first portion v^ith an 
identifier; a second portion with size of tag data; and a third portion having the tag data. 

[11] In other aspects of the invention, methods are implemented for creating a data structure 
for storing ink comprising the steps of receiving ink information; detemwning ink 
properties; determining which ink properties are global; and creating a data structure with 
one representation of global for a set of ink strokes. 

[12] In further aspects of the invention, methods for using a data structure for storing ink are 
implemented including identifying a tag; retrieving the size of the data associated with 
the tag; and, if an application can use tag, then reading the data associated with the tag, 
otherwise skipping the tag by skipping past the tag data based on the size of the retrieved 
size of the data associate with the tag. 

[13] In further aspects of the invention, a system creates a data structure including an input 
receiving ink strokes; a processor parsing the received ink strokes and determining 
properties associated with the ink strokes; and a storage storing the ink strokes in a 
tagged data format. 

[14] In still further aspects of the invention, a system uses a data structure having a storage 
storing ink strokes in a tagged data format; and a processor retrieving the ink strokes and 
processing at least some of the tags associated with the ink strokes. 

[15] These and other features and aspects of the invention will be apparent upon consideration 
of the following detailed description of the preferred embodiments. 

Brief Description of the Drawings 

[16] The foregoing summary of the invention, as well as the following detailed description of 
preferred embodiments, is better understood when read in conjunction with the 
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accompanying drawings, which are included by way of example, and not by way of 
limitation with regard to the claimed invention. 

[17] Figure 1 shows a schematic diagram of a general-purpose digital computing environment 
that can be used to implement various aspects of the invention. 

[18] Figure 2 shows a plan view of a tablet computer and stylus that can be used in accordance 
with various aspects of the present invention. 

[19] Figure 3 shows a flowchart showing a process for storing ink strokes in accordance with 
embodiments of the present invention. 

[20] Figure 4 shows a general data structure for storing ink and or properties associated with 
ink in accordance with embodiments of the present invention. 

[21] Figure 5 shows a data structure for storing ink strokes including properties in accordance 
with embodiments of the present invention. 

[22] Figures 6A-C show data structures having a GUID table in accordance with embodiments 
of the present invention. 

[23] Figure 7 shows a detailed data structure having nested tags related to ink objects in 
accordance with embodiments of the present invention. 

[24] Figure 8 shows another data structure for storing strokes in accordance with embodiments 
of the present invention. 

[25] Figure 9 shows a data structure for storing drawing attributes in accordance with 
embodiments of the present invention. 

[26] Figure 10 shows data structure for storing stroke descriptors in accordance with 
embodiments of the present invention. 

[27] Figure 11 shows a data structure for metrics for ink in accordance with embodiments of 
the present invention. 
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[28] Figure 12 shows a data structure for storing packet properties in accordance with 
embodiments of the present invention. 

[29] Figure 13 shows another data structure storing strokes in accordance with embodiments 
of the present invention. 

[30] Figure 14 shows a data structure storing a point property list in accordance with 
embodiments of the present invention. 

[31] Figure 15 shows a detailed example of an ink object having at least one stroke and 
properties in accordance with embodiments of the present invention. 

[32] Figure 16 shows an example of a stroke of ink having points and properties in accordance 
with the present invention. 

[33] Figure 17 shows a first method for reading a data structure with known and unknown 
properties in accordance with embodiments of the present invention. 

[34] Figure 18 shows a second method for reading a data structure with known and unknown 
properties in accordance with embodiments of the present invention. 

Detailed Description of Preferred Embodiments 

[35] The following description is divided into sub-sections to assist the reader. The sub- 
sections include: characteristics and storage of ink; terms; general purpose computer and 
associated hardware; an example of strokes of ink; conmion stream items; a complex 
example; encoding of values; and a summarization of the storage of ink. 

Characteristics and Storage of Ink 

[36] The present invention relates to the storage of electronic ink and/or properties or other 
data associated with electronic ink. Ink as used herein refers to electronic ink. Ink refers 
to a sequence of strokes, where each stroke is comprised of a sequence of points. The 
points may be represented using a variety of known techniques including Cartesian 
coordinates (X, Y), polar coordinates (r, 0), and other techniques as known in the art. 
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[37] As known to users who use ink pens, physical ink (the kind laid down on paper using a 
pen with an ink reservoir) may convey more information than a series of coordinates 
connected by line segments. For example, physical ink can reflect pen pressure (by the 
thickness of the ink), pen angle (by the shape of the line or curve segments and the 
behavior of the ink around discreet points), and the speed of the nib of the pen (by the 
straightness, line width, and line width changes over the course of a line or curve), 

[38] To provide the look and feel of physical ink, the exemplary disclosed system and method 
store ink strokes and properties associated with the ink strokes to more fully render ink. 
In some embodiments, ink may be stored as a series or set of strokes and a series or set of 
properties. In other embodiments, ink may be stored with a complex series of properties 
in which the properties have properties of their own. Properties of the ink may include, 
for example, color, width, pressure between the stylus and tablet, and angle between the 
stylus and tablet, and pen shape and the like. While these properties may suffice for many 
applications, embodiments of the present invention provide for the extensible storage of 
custom properties (and other data) generated by applications. All strokes and values may 
be stored directly with excess information. However, alternative embodiments reflect 
considerations that eliminate excess information when possible or practicable. 

[39] The properties used to define an ink object and the strokes within the ink object may have 
varying scope. For example, some properties may apply to all ink strokes in an ink object 
(e.g., the shape of a pen tip). Other properties may relate only to a specific point (e.g., a 
point at which a stylus starts a stroke). Others may relate to specific strokes while others 
may relate to packets of information as reported by hardware (e.g., coordinates, pressure, 
angle of pen, the intervals of time between reported coordinates, and the like). In short, 
properties have different levels of scope. 

[40] To efficiently store properties, some may be explicitly specified while others may be 
implicit. In a simple example, all properties may be default properties and not specified in 
an ink object. So, the ink object may only have X and Y coordinate values. In another 
example, the ink object may have properties that affect the entire ink object but the 
properties are specified in the ink object. In a third example, some strokes may have a 
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first set of properties and others have a second set of properties. The properties may be 
defined initially at the beginning of the ink object and the individual strokes may 
reference the previously defined properties as needed. Using this approach of defining 
properties then later referencing the properties promotes a greater efficiency in storing 
properties. This becomes more apparent as an ink object becomes larger as the number of 
properties increases and the number of ink strokes referencing the properties increases. 

Terms 

[41] Ink - A sequence or set of strokes with properties. A sequence of strokes may include 
strokes in an ordered form. The sequence may be ordered by the time captured or by 
where the strokes appear on a page. Other orders are possible. A set of strokes may 
includes sequences of strokes or unordered strokes or any combination thereof 

[42] Stream - A sequence of strokes that may or may not include properties that comprises a 
data structure. 

[43] Ink object - A data structure storing a stream with or without properties. 

[44] Stroke - A sequence or set of captured points. For example, when rendered, the sequence 
of points may be connected with lines. Alternatively, the stroke may be represented as a 
point and a vector in the direction of the next point. In short, a stroke is intended to 
encompass any representation of points or segments relating to ink, irrespective of the 
underlying representation of points and/or what connects the points. 

[45] Point - Information defining a location in space. For example, the points may be defined 
relative to a capturing space (for example, points on a digitizer), a virtual ink space (the 
coordinates in a space into which captured ink is placed), and/or display space (the points 
or pixels of a display device). 

[46] Virtual Ink Space - A framework to which all ink strokes relate. The framework may 
include a two or three-dimensional shape. In one example, the framework may include a 
unit size square. In another example, the framework may include a defined rectangle. 
While some ink strokes may extend outside of the framework, the framework may be 
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used for rendering purposes including dimensioning for a printer or a display. In one 
aspect, the framework is a norm to which ink strokes may be spatially defined. 

[47] Global Ink Properties - These are properties that apply to a stroke or set of strokes unless 
otherwise defined. For example, a selected ink color may be blue. By setting all strokes to 
blue, the default color of the strokes would be blue. 

[48] Local Ink Properties - These are properties that apply to a specific stroke (or data point or 
data points). For example, while a global ink property may be blue, a specific stroke may 
be set to red. Some local ink properties may be interpreted, in some cases, as global 
properties as they effect subsequently encountered strokes in an ink object. 

[49] Render - The process of determining how graphics (and/or ink) is to be displayed, 
whether on a screen or printed, or output into another file format. 

[50] GUID - A Global Unique Identifier is a term well known in the art. A GUH) is an integer 
(for example, 64 bit, 128 bit, 256 bit, or the like) that identifies an interface to an object 
across all computers and networks. A GUID may identify interfaces, replica sets, records, 
and other objects. Such an identifier is unique with a very low probability of being 
duplicated. Other definitions are known in the art and are incorporated herein by 
reference. 

General Purpose Computer 

[51] Figure 1 illustrates a schematic diagram of an exemplary conventional general-purpose 
digital computing environment that can be used to implement various aspects of the 
present invention. In Figure 1, a computer 100 includes a processing unit 110, a system 
memory 120, and a system bus 130 that couples various system components including the 
system memory to the processing unit 110. The system bus 130 may be any of several 
types of bus structures including a memory bus or memory controller, a peripheral bus, 
and a local bus using any of a variety of bus architectures. The system memory 120 
includes read only memory (ROM) 140 and random access memory (RAM) 150, 
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[52] A basic input/output system 160 (BIOS), containing the basic routines that help to 
transfer information between elements within the computer 100, such as during start-up, 
is stored in the ROM 140. The computer 100 also includes a hard disk drive 170 for 
reading from and writing to a hard disk (not shown), a magnetic disk drive 180 for 
reading from or writing to a removable magnetic disk 190, and an optical disk drive 191 
for reading from or writing to a removable optical disk 192 such as a CD ROM or other 
optical media. The hard disk drive 170, magnetic disk drive 180, and optical disk drive 
191 are connected to the system bus 130 by a hard disk drive interface 192, a magnetic 
disk drive interface 193, and an optical disk drive interface 194, respectively. The drives 
and their associated computer-readable media provide nonvolatile storage of computer 
readable instructions, data structures, program modules and other data for the personal 
computer 100. It will be appreciated by those skilled in the art that other types of 
computer readable media that can store data that is accessible by a computer, such as 
magnetic cassettes, flash memory cards, digital video disks, Bernoulli cartridges, random 
access memories (RAMs), read only memories (ROMs), and the like, may also be used in 
the example operating environment. 

[53] A number of program modules can be stored on the hard disk drive 170, magnetic disk 
190, optical disk 192, ROM 140 or RAM 150, including an operating system 195, one or 
more application programs 196, other program modules 197, and program data 198. A 
user can enter commands and information into the computer 100 through input devices 
such as a keyboard 101 and pointing device 102. Other input devices (not shown) may 
include a microphone, joystick, game pad, satellite dish, scanner or the like. These and 
other input devices are often connected to the processing unit 110 through a serial port 
interface 106 that is coupled to the system bus, but may be connected by other interfaces, 
such as a parallel port, game port or a universal serial bus (USB). Further still, these 
devices may be coupled directly to the system bus 130 via an appropriate interface (not 
shown). A monitor 107 or other type of display device is also connected to the system 
bus 130 via an interface, such as a video adapter 108. In addition to the monitor, personal 
computers typically include other peripheral output devices (not shown), such as speakers 
and printers. In a preferred embodiment, a pen digitizer 165 and accompanying pen or 
stylus 166 are provided in order to digitally capture freehand input. Although a direct 
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connection between the pen digitizer 165 and the processing unit 110 is shown, in 
practice, the pen digitizer 165 may be coupled to the processing unit 1 10 via a serial port, 
parallel port or other interface and the system bus 130 as known in the art. Furthermore, 
although the digitizer 165 is shown apart from the monitor 107, it is preferred that the 
usable input area of the digitizer 165 be co-extensive with the display area of the monitor 
107. Further still, the digitizer 165 may be integrated in the monitor 107, or may exist as 
a separate device overlaying or otherwise appended to the monitor 107, 

[54] The computer 100 can operate in a networked environment using logical connections to 
one or more remote computers, such as a remote computer 109. The remote computer 
109 can be a server, a router, a network PC, a peer device or other common network 
node, and typically includes many or all of the elements described above relative to the 
computer 100, although only a memory storage device 111 has been illustrated in Figure 
1. The logical connections depicted in Figure 1 include a local area network (LAN) 112 
and a wide area network (WAN) 113. Such networking environments are commonplace 
in offices, enterprise-wide computer networks, intranets and the Internet. 

[55] When used in a LAN networking environment, the computer 100 is connected to the 
local network 112 through a network interface or adapter 114. When used in a WAN 
networking environment, the personal computer 100 typically includes a modem 115 or 
other means for establishing a coramunications over the wide area network 113, such as 
the Internet, The modem 115, which may be internal or external, is connected to the 
system bus 130 via the serial port interface 106. In a networked environment, program 
modules depicted relative to the personal computer 100, or portions thereof, may be 
stored in the remote memory storage device. 

[56] It will be appreciated that the network connections shown are exemplary and other 
techniques for establishing a communications link between the computers can be used. 
The existence of any of various well-known protocols such as TCP/IP, Ethernet, FTP, 
HTTP and the like is presumed, and the system can be operated in a client-server 
configuration to permit a user to retrieve web pages from a web-based server. Any of 
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various conventional web browsers can be used to display and manipulate data on web 
pageSe 

[57] Figure 2 illustrates an exemplary tablet PC 201 that can be used in accordance with 
various aspects of the present invention. Any or all of the features, subsystems, and 
functions in the system of Figure 1 can be included in the computer of Figure 2. Tablet 
PC 201 includes a large display surface 202, e.g., a digitizing flat panel display, 
preferably, a liquid crystal display (LCD) screen, on which a plurality of windows 203 is 
displayed. Using stylus 204, a user can select, highlight, and/or write on the digitizing 
display surface 202. Examples of suitable digitizing display surfaces 202 include 
electromagnetic pen digitizers, such as Mutoh or Wacom pen digitizers. Other types of 
pen digitizers, e.g., optical digitizers, may also be used. Tablet PC 201 interprets gestures 
made using stylus 204 in order to manipulate data, enter text, create drawings, and/or 
execute conventional computer application tasks such as spreadsheets, word processing 
programs, and the like. 

[58] The stylus 204 may be equipped with one or more buttons or other features to augment its 
selection capabilities. In one embodiment, the stylus 204 could be implemented as a 
"pencil" or "pen", in which one end constitutes a writing portion and the other end 
constitutes an "eraser" end, and which, when moved across the display, indicates portions 
of the display are to be erased. Other types of input devices, such as a mouse, trackball, 
or the like could be used. Additionally, a user's own finger could be the stylus 204 and 
used for selecting or indicating portions of the displayed image on a touch-sensitive or 
proximity-sensitive display. Consequently, the term "user input device", as used herein, 
is intended to have a broad definition and encompasses many variations on well-known 
input devices such as stylus 204. Region 205 shows a feedback region or contact region 
permitting the user to determine where the stylus 204 as contacted the display surface 
202. 

[59] In various embodiments, the system provides an ink platform as a set of COM 
(component object model) services that an application can use to capture, manipulate, and 
store ink. One service enables an application to read and write ink using the disclosed 
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representations of ink. The ink platform may also include a mark-up language including a 
language like the extensible markup language (XML). Further, the system may use 
DCOM as another implementation. 

An Example of Strokes of Ink 

[60] An exemplary ink object is shown in Figure 16. The ink object starts at point 1601 where 
a pen down action occurred. The pen down action may be stylus 204 contacting the 
display surface 202, the click of a mouse button, the operation of a button on a trackball 
or joystick, or the like. The user controls an input (such as stylus 204) device and the 
resulting stroke continues through points 1602-1616. At point 1616, a pen up action 
occurred. The pen up action may be the lifting of the stylus 204 off the display surface 
204, releasing or another operation of a mouse button, or the operation of the button (or 
other buttons) on the trackball or joystick or the like. Here, a pen up action and a pen 
down action are known in the pen digitizing art. 

[61] From points 1601 through 1608, the width of the stroke has a first value. At point 1608, 
the width of the stroke changes to a second value. This may have been because the user 
increased the pressure between the stylus 204 tip and the display surface 204, because the 
angle between the stylus 204 and the tablet changed, because the stylus 204 was rotated 
and projected a different cross section of the stylus 204' s nib, or the like. The stroke then 
continues through point 1616 with the second stroke width. In an alternate embodiment, a 
user started the stroke with a first line width and selected a different line width at point 
1608 to complete the stroke. In a further embodiment, two strokes may form the ink 
object as shown in Figure 16. For example, a first stroke may include points 1601-1608 
and a second stroke may include points 1608-1616. 

[62] Next, the ink object is stored. The ink object stroke may be stored as a single stroke with 
varying line widths. Alternatively, the ink object may be stored as two or more strokes in 
which each stroke has its own set of properties. Third, the ink object may be stored as 
short strokes between points. 

[63] Figure 3 shows an exemplary method for storing the various strokes of Figure 16. First, 
in step 301, stroke information is received. This information may come from a display 
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surface 202 or any other source that is capable of generating strokes. Next, in step 302, 
the system 201 parses the stroke information 302. The system may parse a number of 
items including, for example, pressure information, stylus 204 tilt information, position 
information and the like. The parsed information is temporarily stored in step 303, 
properties added in step 304 (for example, global and/or local properties) and stored in 
step 305 in a data structure. 

[64] The data structure may contain an identifier 306 that indicates that the following structure 
is an ink object, size of the remaining sets of data 307 (or count of the properties, strokes, 
sub-objects and the like present in the data structure), a list of global ink properties 308 
that apply as the default properties for contained strokes, and/or a list of strokes 309. 
Additional information is described with respect to Figure 4. 

[65] Figure 4 shows a basic exemplary structure in which an identifier is followed by data 
relating to the identifier. Here, the identifier is generally referred to as a tag (or TAG or 
Tag). A *Tagged" structure, as shown in Figure 4, begins with an identifying "tag" 
followed by a "size field" followed by data. The "tag" identifies the contents of the data 
while the "size field" identifies, for example, the size of the data in bytes (or bits and the 
like). The tag may be either a predefined tag or an application-specific custom tag. In 
altemate embodiments, the size of the tagged field may appear prior to the tag itself. 
Benefits of this arrangement are described in relation to Figure 18. 

[66] The structure as shown in Figure 4 may also include a count of the number of objects, 
tags, properties, strokes, and the like contained within it. In this regard, the "count" 
identifier may be used in place of the "size" identifier. If one specifies the size of the rest 
of the data of the tag, a system may then quickly skip over the rest of the data of the tag if 
desired. On the other hand, if the count of the number of objects (or properties or the like) 
was specified, the physical size of the count would likely be smaller than the physical 
size of the remaining data. In this regard, the ink object would be smaller if a count of 
remaining objects (or the like) was used rather than a size of the remaining data. 
However, to skip over the remaining part of an ink object or property, one may need to 
enumerate all of the sub-objects (or sub-tags or sub-properties) contained within the 
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object or tag. To enumerate these sub-parts, a system may need to perform additional 
calculations to obtain the number of sub-parts. Further, a system may need to perform 
additional steps on the skipping operation (for example, advancing past the present object 
or tag or property) by counting the sub-parts in a count-based data structure, rather than 
advancing to a new position as used in a size-based system. 

[67] Referring to Figure 4, TagO 401 is an identifier that denotes that data relating to a first tag 
will follow. The size of the data associated with TagO appears in the next portion 402a. 
Alternatively, the count of the remaining items in TagO may be specified in portion 402b. 
The data itself for TagO appears in portion 403. Similarly, Tagl 404 indicates that data 
following the Tagl 404 will relate to Tagl 404. Here, portion 405a indicates the size of 
the data of Tagl 406. Alternatively, portion 405b indicates the number of objects (or sub- 
tags or sub-properties or the like) in the data of Tagl 406. Next, the data for Tagl 406 
follows. One benefit of placing the size after the tag itself is that applications that do not 
recognize the tag may know, by reading the next portion of information (the size block 
402 or 405), the length of data needed to skip over to arrive at the next tag or end of the 
ink data structure. If a further embodiment, both portions 402a and 402b (as well as 405a 
and 405b) may appear in the same data structure. 

[68J As to the specification of size of following data or the count of items in following data, it 
is appreciated that one may use either of the two ways of specifying information. For 
simplicity, the following description includes the use of size information. In some 
instances, count information is also shovm. However, where only size information is 
shown, count information may readily be specified in place of or in addition to the size 
information. 

[69] As applied to ink, the tagged data structure of Figure 4 may be enhanced in a number of 
ways. In some embodiments, ink strokes may be defined to come in order. In other 
embodiments, global properties (or properties that may affect all subsequent ink strokes) 
are provided at a known location for the system. For example, all global properties may 
be specified at the beginning of the ink object. On the other hand, all global properties 
may be specified at the end of the ink object. One advantage of putting the global 
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properties at the beginning of the ink object is that a system would already know how to 
handle a stroke that referenced global properties once it encounters the stroke (as it would 
have already encountered the global properties section of the ink object). In yet more 
embodiments, custom properties may be defined through various tables. Figures 6A-6C 
relate to the use of GinD tables and their use in ink objects. The use at least one of these 
or other enhancements permit properties to be used throughout the ink object and permit 
a more efficient storage of ink information. 

[70] Figure 5 shows another example of a data structure 501. Figure 5 shows a ink object 
identifier 504 at the beginning of the data structure. The ink object identifier identifies the 
following data structure as an ink object. The ink object identifier may also include 
version information that relates to the version of the software used to write the data 
structure storing the ink object. Following the ink object identifier 504 is a number 505a 
that indicates the size, in bytes, of all the global ink properties 502 and all the strokes 
503. Alternatively, the object count 505b may be specified in place of size 505a. Further, 
one may use both the size 505a and object counts 505b. The properties are identified with 
a tag that identifies the global properties 506. Next is a list of some or all the properties, 
both custom 507 and predefined 508, which apply to the ink as a whole. At the end is a 
list of strokes 503 (with an identifier or tag 509) identifying the Hst of strokes 510-511. 

[71] The "tag" describing the "data" indicates whether the "data" contains tagged structures or 
even how many tagged structures. For example, custom properties are considered opaque 
to the present system since they are application-defined and thus only contain data. ^ 
However, a stroke may have one or more stroke properties, which are also represented as 
tagged structures. 

[72] In some embodiments, the system attempts to minimize the use of redundant information. 
In one example, the system takes resources that may apply to multiple strokes, such as a 
drawing attribute, and only storing it once in the ink stream using a similar table strategy 
to that described in Figure 5. For example, a drawing attribute, which describes how a 
stroke is to be rendered, may be stored in a table. This table is then stored as an ink 
property that relates to all ink within its object. This means that, unless otherwise defined, 
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the previously defined set of drawing attributes applies to each stroke. Further, if multiple 
properties are stored in a table (for example, properties specifying different types of pen 
tips (round, square, rectangle, etc.)) a reference to a desired property is stored with each 
stroke instead of the whole drawing attribute. A similar approach may be used for other 
shared resources that may apply to multiple strokes. 

[73] The space saving use of references may be used with GXUDs. In the present system, 
GUIDs may be used to represent the various tags. Using GUIDs assures (for all practical 
purposes) that the tags will always be unique for all vendors. In one embodiment, GUIDs 
are used directly. Where the GUID is large, such as 16 bytes in size, it is impractical to 
use the GUBD as the tag since it would make the ink objects extremely large. Instead, 
some embodiments use an index value as the tag that actually maps into a GUID lookup 
table. 

[74J Figure 6A shows an example of a GUID look-up table. Here, an ink object identifier 701 
and size of the stream 702 are set forth. Next, in portion 703 is a GUID table with GUID 
table tag 710 and the size of the GUID table data 711. GUDDs 714 and 715 are listed. In 
Figure 6A, the identifiers used to reference the GUIDs for TagO and Tagl are not listed. 
Rather, they are implicit based on predetermined sequence that the first GUID is assigned 
a first identifier, the second GUID is assigned a second identifier and the like. For 
example, in Figure 6A, the identifier "100" may be associated with the first GUID 714 
and the identifier "101" may be associated with the second GUID 715. So, when in a 
property table a reference to GUID 101 is made, the system knows that GUID 715 is 
intended. 

[75] The implicit ordering method of Figure 6A may be used for all GUIDs. On the other 
hand, the implicit ordering method of Figure 6A may be used for all non-predefined 
GUIDs. So, a number of GUDDs may be predefined with identifiers in the system with all 
custom GUIDs (or GUIDs above the predefined number) listed separately as shown in 
Figure 6A, The predefined identifiers may be those that are often used for GUIDs relating 
to color, pressure, etc. In some applications, it may be possible that there never is a GUID 
table in the stream unless the application actually needs custom properties. 



16 



Banner &Witcoff, Ltd 
03797.00132/171946.1 

[76] The remaining fields 704 - 709 are siniilar to that described with respect to Figure 5. 
Here, for instance, part of the data of field 706 includes a reference to the GUID table 
703 as "100". Because the system knows "100" relates to the GUID 714, the data in field 
706 is understood. Similarly, the data for Tagl in field 709 (preceded by the Tagl 707 
identifier and the size of Tagl data in field 708) may make reference to QUID table 703 
with a QUID reference value of "101" (or "100" depending on the QUID to be 
referenced). Because the system knows what GUIDs relate to what identifiers, the system 
may process the data of fields 706 and 709 properly, 

[77] Figure 6B shows an alternative representation of the data structure of Figure 6A. In 
Figure 6B, the data structure includes explicit identifiers 712 and 713 for GUIDs 714 and 
715, respectively. Using this arrangement, all identifiers for the GUIDs may be explicitly 
specified. So, if the system has identifiers "100" and "101" predefined for something 
else, the data structure permits explicit storage of the identifiers (e.g., "507" and "902" to 
be associated with the GUIDs 714 and 715. 

[78] Figure 6C shows yet another data structure arrangement in which the GUIDs 714 and 715 
are specified and the GUID identifiers 712 and 713 follow the GUIDs, 

[79] The table sets forth relations between identifiers for TagO 712 and Tagl 713 and the 
GUID value for TagO 714 and the GUID value for Tagl 715. 

[80] The GUID lookup table 703 may be stored in the serialized stream as an ink property. 
This means that each GUID does not need to be repeated in each property of each stroke. 
Rather, actual tags in the serialized stream identifying a GUID can be a small integer 
index. Very little space is wasted on the GUID since it is stored only once in the stream, 

[81] Finally, while GUIDs are used to describe the system, it is appreciated that other 
identification systems may be used to identify the tags as described herein. For example, 
the GUIDs may be replaced by smaller identifiers, which then reference a GUID table for 
all ink objects. While one would lose some degree of portability of ink, as the ink objects 
are no longer completely contained, the size of the ink objects may be decreased. 
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Figure 7 shows another example of a tagged data structure used to store ink. In Figure 7, 
the tags are nested. In other words, some tags contain other tags. One advantage of 
nesting tags includes suggesting a relationship by the location and nesting of the tags. 
Here, TagO 601 is represented with the size 602 of TagO and the data 603 for TagO. Tagl 
604 includes other tags. Field 605 represents the size of the rest of Tagl. TaglO is nested 
within Tagl. Field 607 identifies TaglO and its size is listed in field 608. The data of 
TaglO are shown in field 610. Here, TaglO contains another tag, namely Tag 100, which 
is identified by field 609, with its size provided in field 611 and its data in field 612. 
Finally, Tagl contains Tagil, which is identified by field 613 and has the size of its data 
in field 614 and the data for Tagil in field 615. Possible locations for the data of tags 
Tagl and TaglO are as shown in Figure 7. Thus, multiple levels of properties associated 
with ink may be stored in an efficient manner. 

In another example, as shown in Figure 8, multiple strokes may be stored in a stream. 
Figure 8 contains only X and Y data in each stroke. The example of Figure 8 shows ink 
object identifier 801, size of stream 802, stroke tags 803 and 808, size of stroke blocks 
804 and 809, the number of strokes in each stroke 805 and 810, the X coordinates (806 
and 811) and the Y coordinates (807 and 812). 

Both strokes in the above example are of the simplest form since they contain only the 
count of points 805 and 810, and the X and Y data for each stroke. The X and Y data may 
be compressed or uncompressed. Compressing the data saves space, but leaving the data 
in an uncompressed form generally increases the speed of the storing and rendering 
processes. 

A stroke with no properties may be defined as the default stroke. That is, there are no 
additional packet properties (such as pressure) associated with each point, nor any other 
per-stroke information in any of die strokes. One implementation would render this ink 
with the default drawing attributes (black pen, width of one, ball tip, etc.) 

The count of bytes used to store the compressed X data or compressed Y data may be 
stored separately from the disclosed data structure. This is advantageous because given 
the count in the number of points, the decompression algorithm can just read enough 
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bytes from the stream until it decompresses the number of coordinates, thereby increasing 
the decompression speed. The first byte of the X and Y data indicates the compression 
algorithm used, if any. 

Common Stream Items 

[87] One or more tags may be used to assist in the reduction of redundant data in the in 

stream. While various tag names are given, it is appreciated that these tag names are for 
example purposes only. 

[88] First, one may use an ink object identifier (see ink object identifier 801 with a version 
number in Figure 8) to indicate to alert an application whether it can adequately process 
the following stream of information. If an application can only process a lower version 
number, once it reads the higher version number, it will stop processing the stream. When 
rendering, this may cause the application to retrieve an empty ink object (an ink object 
with no content) and return an error to the apphcation. 

[89] Second, following the ink object identifier may be the size field (see size 802 in Figure 8) 
for the stream. The value of this field indicates the size of the stream, such as the total 
number of bytes following the size field. 

[90] A conforming implementation should, in one example, be prepared to accept a size value 
equal to the largest file size on Windows NT. Presently, this is a 64-bit number but may 
be larger in the future. In some implementations, such as placing the present system on a 
limited resource device (e.g., a Windows CE device or a Palm OS device), it may be 
difficult for an application to process a file of this size. The implementation in that case 
may generate an appropriate error to the application. 

[91] An additional benefit of having the size of the stream actually stored in the stream is that 
if an application passes a random buffer to the system, then the first two values in the file 
will usually be zero. This means that the present system will generally immediately stop 
processing an invalid stream. 
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[92] Third, global ink properties may be defined to minimize or eliminate redundant 
information. In simple streams, there may be no global ink properties. However, if there 
are global ink properties, then it is preferable that they appear before any strokes so that 
they may apply to all strokes. There are generally eight global ink properties that may be 
used. Of course, more or less global properties may be defined and used as needed. 

[93] The global properties may include one or more of the following: 

1) a GUK) table; 

2) an ink space rectangle; 

3) a drawing attribute table; 

4) a stroke descriptor table; 

5) a transform table; 

6) a metric table; 

7) a compression header; 

8) and custom ink properties. 

[94] It is appreciated that the above times may also be included as local properties if nested 
within other tags. 

1) GUID Table 

[95] The GUID table, if it appears in the stream, preferably appears immediately after the size 
field. It is appreciated that the GUK) table may appear later or earlier. However, the 
global application of the GUIDs might a) only apply to items subsequent to the GUID 
table and/or b) slow down the processing of the ink as previous processed tags may need 
to be reevaluated after encountering a GUID table. 
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[96] The GUID table may be identified with the tag 710. In one example, one may use the tag 
TAG„GUID_„TABLE. The size of the table is a multiple of the GUID size, where each 
entry is a custom GUID as used in the stream. 

2) Ink Space Rectangle 

[97] An ink space rectangle defines a virtual coordinate space (or virtual ink space) for 
receiving ink. One tag that may be used includes TAG„INK_SPACE_RECT, which 
identifies the ink space rectangle when present in the stream. It does not have a size field 
since it has a fixed number of elements of, for example, four signed numbers. These four 
numbers represent the left, top, right, and bottom of the ink space. The ink space 
rectangle defines the virtual coordinate space for the ink. An application may use this 
rectangle to determine what area of the ink to either display or print. The ink space 
rectangle essentially defines a virtual sheet of paper that the ink is drawn on. This does 
not mean that ink may not appear outside this area. However, the application may use this 
rectangle when deciding how to display the ink according to a given view or on the 
printer. 

3) Drawing Attributes Table 

[98] The drawing attributes table may list all drawing attribute sets of properties (also referred 
to as blocks) in the stream. Each drawing attribute block defines information used while 
rendering the ink. These blocks may apply to one or more strokes and may be placed in 
the drawing attributes table so that they are not repeated in each stroke. 

[99] An exemplary tag includes TAG„DRAW_ATTRS_TABLE as an identifier. This 
identifier may be followed by the size of the table. The size of the table may be equal to 
the sum of the sizes of all drawing attribute blocks. 

[100] A drawing attributes table containing only one drawing attributes block is a special case. 
In one embodiment, the tag and size for the table is omitted and the entire table is 
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replaced by a single drawing attributes block. This reduces information in the drawing 
attributes table when not needed to define only a single entry. 

[101] Figure 9 shows a sample drawing attribute block. Each drawing attribute block starts with 
an identifier. Here, the identifier 901 may be represented as 
TAG_DRAW_ATTRS3LOCK. The identifier 901 may be followed by the size 902 of 
the block. The block may contain a tagged list of drawing attributes. Each entry in the list 
of drawing attributes may be a pre-defined or custom tag. 

[102] Figure 9 shows a sample pen width tag 903 with its accompanying value 904, a color 
reference tag 905 and its accompanying value 906, a custom drawing attribute tag 907 
with its size 908, and custom drawing attribute data 909. 

[103] To save space, the tag for the block may be is omitted when the block appears in a 
drawing attributes table. This is because the table can only contain blocks and the tag is 
redundant. The next block may start immediately after the end of the previous block. 

[104] Furthermore, all predefined drawing attributes, such as the pen width or color reference, 
have no size indicator where the data types (e.g., positive integers) are known ahead of 
time. However, a custom property preferably has a size field since the size of the data is 
not known ahead of time. 

[105] In one embodiment, default and non-default values are stored. Alternatively, another 
possible space saving optimization is that drawing attributes that are set to the default 
value are not stored in the attributes block. In the above example, pen tip value is not 
stored since it is assumed to be the default value. Here, the pen tip may be considered to 
be a ballpoint pen as represented by the tag PEN_TIP_BALL. 

4) Stroke Descriptor Table 

[106] The stroke descriptor table lists stroke descriptor blocks in the stream. These blocks may 
apply to one or more strokes and may be placed in this table so that they are not repeated 
in each stroke. 
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[107] An identifier is used to identify the stroke descriptor table. The tag may include 
TAG_STROKE_DESC_TABLE, The size of the table may follow the size of the table. 
The size of the table may be equal to the sum of the sizes of all stroke descriptor blocks* 

[1081 The system may have multiple levels of blocks even if there is only one block for a table. 
On the other hand, a stroke descriptor table containing only one, single level, stroke 
descriptor block may be considered a special case. In this regard, the tag and size for the 
table is omitted and a single stroke descriptor block may be used to replace the entire 
table. In short, a table may contain multiple blocks except when there would only be one 
block to table. Here, the ink object may only include the stroke descriptor block. 

[109] A stroke may contain arrays of data where each array element corresponds to a property 
of a point. Figure 8 shows only X and Y data since there was no stroke descriptor block 
in the stream. However, an application may attempt to store other properties, such as 
pressure. An application may simply create a custom stroke property (described later) to 
store pressure. Unfortunately, no other application would know how to interpret this data 
and the tag and size of the custom property would be stored in each stroke, thus wasting 
space in the data structure. 

[110] In this event, the stroke descriptor block may be used to solve this problem by defining 
the data types and their order in the stroke. The system may then use an index to associate 
a stroke with a particular stroke descriptor block. 

[Ill] Typically, all strokes in an ink stream will use the same stroke descriptor block. 
However, a stroke descriptor table that contains only one block is rare. However, the 
system allows different strokes to contain different sets of data by placing the blocks in a 
table. 

[112] Referring to Figure 10, each stroke descriptor block may include an identifier 1001. Here, 
the following tag may be used: TAG_STROKE_DESC„BLOCK. The size 1002 
generally follows the identifier 1001 . 

[113] It is assumed herein that, by default, all strokes will contain X and Y coordinate data 
arrays and that these will be the first data arrays stored in a stroke. This provides the 
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ability for the stroke to default to looking for X and Y coordinate data (which is more 
likely than having strokes lack X and Y data). If for some reason the application does not 
wish to store X and Y coordinates, then it may create a stroke descriptor block containing 
placeholders to occupy the designated fields for the X and Y coordinate arrays. Here, the 
placeholders are No X Data Tag 1003 and No Y Data Tag 1004. One may also use 
TAG_NO„X and TAG_NO_Y values as the two entries. Otherwise the first two arrays in 
a stroke may be assumed to correspond to the X and Y coordinates. 

[114] After the optional TAG_NO_X, TAG_NO_Y placeholders is an array of packet property 
tags 1005. This array ends when a button identifier or a stroke property identifier is 
encountered. These may be represented as well as TAG„BUTTONS, 
TAG_STROKE„PROPERTY_LIST or the end of scope is encountered. Each packet 
property tag defines another array in the stroke. 

[115] Following the packet property array 1005 may be an optional "buttons" section that 
describes the button bit-fields that make up the elements of the button array in the stroke. 
Not all input devices report button states, so this section is optional. If present, the 
"buttons" section may start with a first button description tag 1006 (e.g., a tag may be 
TAG_BUTTONS) followed by the count of buttons 1007 (as represented by an identifier, 
for instance, "cButtons") and an array of button GUID tags 1008, one tag for each button. 
Note that these tags may be encoded (described later) and the size of the button array may 
not be an exact multiple of the number of buttons (cButtons). 

[116] If the end of the stroke descriptor block scope has not been reached, then what may 
follow is a stroke property list. The stroke property list may include identifier 1009 (one 
may also use TAG_STROKE_PROPERTY_LIST). Following the identifier 1009 is a list 
of stroke property tags in the array of tags for stroke properties 1010. These tags do not 
describe arrays of values. Instead they are an optimization that allows a tag that appears 
repeatedly in strokes to be omitted; only the size and the data need to be specified for the 
listed property. A stroke may still have additional stroke properties that are not listed in 
its stroke descriptor block under TAG_STROKE_PROPERTY_LIST. However, these 
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additional properties may have to be listed in the stroke after all the properties listed in 
the block and they should be tagged explicitly within the stroke. 

5) Transform Table 

[117] As described above, ink may be captured in a variety of ways. The ink may be made 
larger or smaller, rotated, translated, warped, and the like. These transformations present 
a variety of problems. The first problem is that the assumptions made when designing the 
compression schemes may no longer be valid. This may result in data that would have 
been compressed well in its native form to bloat and compress very poorly. The second 
problem is that it is very easy to perform operations on the ink that would cause precision 
to be lost. This may cause the recognition process to fail and prevent the ink from being 
converted to text or might cause the ink to render incorrectly. 

[118] To solve this problem, some embodiments of the present system may allow the ink to be 
stored in its original native format. For example, whenever ink is transformed, only a 
transform matrix is affected rather than each point in the ink. This preserves the precision 
of the original ink and also allows compression to function optimally. 

[119] The transform table may list all transform blocks in the stream. Each transform block 
may define a unique transform that is applied to the ink points before they are rendered or 
used. These blocks may apply to one or more strokes and are placed in this table so that 
they are not repeated in each stroke. 

[120] One may use the identifier of TAG_TRANSFORM_TABLE to identify the transform 
table. The size of the table may follow. The size of the table may be is equal to the sum of 
the sizes of all transform blocks. 

[121] A transform table containing only one transform block is a special case. The tag and size 
for the table may be omitted and the entire table may be replaced by a single transform 
block. 
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[122] In the simplest case and the cases where scaling and transforms have been applied outside 
of a native capture environment (for example when currently using ink in MS Word), no 
transform tables may be created 

6) Metric Table 

[123] The metric table lists metric blocks in the stream. These blocks may apply to one or more 
strokes and may be placed in this table so that they are not repeated in each stroke. 

[124] The tag, TAG_METRIC_TABLE, may be used to identify the metric table and may be 
followed by the size of the table. The size of the table is equal to the sum of the sizes of 
all metric blocks. 

[125] A metric table containing only one metric block is a special case. The tag and size for the 
table may be omitted and a single metric block may replace the entire table. 

[126] One previous example showed how an ink stream may contain only strokes made up of X 
and Y elements. The stroke descriptor then enabled the addition of more properties to a 
stroke. The metric block further refines the definition of the properties defined in a 
stroke. The individual array elements in a stroke, such as the X or Y values, may be 
represented in logical device coordinates. However there may will be times where an 
application will need to know the relationship between these logical values and some real 
physical characteristics. For example, it may not be known implicitly whether pressure is 
in pounds, Pascal's or kilograms, or an angle with a value of 10 is in degrees or radians. 
Without further information an application may assume these values are in the standard 
normalized fomi, as defined by the ink object system. 

[127] One purpose of the metric block is to defining some relationship between the logical units 
stored in the stroke and physical characteristics. The most common ones being: minimum 
value, maximum value, precision, and/or units. 

[128] Typically all strokes in an ink stream will use the same metrics block. An ink stream may 
have several stroke descriptors, yet still only have one metric block. However, the present 
system may allow different strokes to refer to different metric blocks in the metric table. 
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[129] Each metric block may start with an identifier 1101 (for example, 
TAG_METRIC„BLOCK) and is followed by the size 1102 of the block. The 
TAG_METRIC_BLOCK may be omitted in the metrics table. Next, each entry may 
follow as entry[0] 1103 - entry[number of metric blocks-1] 1105. The various entries may 
describe minimum and/or maximum values for values, the degree of precision and 
associated units, as well as other properties, 

[130] The metric block does not necessarily need to define all the packet property tags that are 
used in the stroke descriptor, since the application may not care about the metrics 
associated with all the properties or the device may not provide metrics for all the 
properties. In order to permit the metric block to be easily read in conjunction with the 
stroke descriptor, the entries in the metric block should be in the same order as found in 
the stroke descriptor. The metric block differs from the stroke descriptor since it may 
contain data for X and Y values (for example, identified with TAG_X and TAG_Y). This 
is because X and Y values may have metrics that need to be stored. 

7) Compression Header 

[131] A block may be specified that addresses the amount of compression to apply to values. 
The header may contain data used by decompression algorithms to correctly decompress 
packet property arrays including X, Y coordinates, pressure and the like. Inclusion of 
compression data may occur when the system determines that predefined compression 
algorithms and their associated levels of compression may not be enough to store 
information to achieve the maximal compression of ink. 

[132] The type of compression used may include various compression algorithms as are known 
in the art including Huffman compression algorithms and others. The compression header 
may include an identifier, the number of bytes needed to store the header, a bit 
assignment table including bit assignment blocks, and/or (optionally) an index map table. 
If the bit assignment table is zero, null, or nonexistent, then the index map table may not 
be included to save space, 

8) Custom Ink Properties 
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[133] Custom ink properties may be defined for various applications. The identifier for a 
custom ink property may list a custom GUID specified in the GUID table. Next, the size 
of the compressed data for the property may be is provided* Finally, the data associated 
with the property may be provided. Additional data may follow in the stream relating to 
the next property. 

[134] Like global properties, local properties may also be specified. Local properties may be 
positioned after the global ink properties. Alternatively, as described above, global ink 
properties may be placed lasted in an ink object. Local properties do not apply to the 
entire ink stream. Strokes are an example of a local property. Other local properties, such 
as a drawing attribute index, may apply to all the strokes that appear after that point in the 
stream until the next time that local property appears in the stream again. Just like global 
ink properties, local properties are also optional. A valid ink stream could potentially 
contain no local properties at all. However, such a stream would not Hkely be useful. 

[135] The local properties may include, for example: 

1) a drawing attribute index; 

2) a stroke descriptor index; 

3) a transform index; 

4) a metrics index; and 

5) strokes 

1) Drawing Attribute Index 

[136] The drawing attribute index (which may be identified by the tag TAG_DIDX) assigns a 
drawing attribute block to a stroke. The drawing attribute index may be followed by an 
index value that specifies the entry in the drawing attributes table. All strokes in the 
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Stream from that point on may use the specified drawing attribute block until the next 
drawing attribute index is encountered in the stream. 

[137] In an alternative embodiment, if there is no drawing attribute index in the stream 
somewhere before the stroke, it may be assumed that this stroke should use the 0th (i.e., 
first) drawing attribute block in the drawing attributes table. And if there is no drawing 
attributes table in the stream, then all strokes may be drawn using the default set of 
drawing attributes. 

2) Stroke Descriptor Index 

[138] A stroke descriptor index (which may be identified using a tag as TAG^SIDX) assigns a 
stroke descriptor block to a stroke. A stroke descriptor index may be followed by an 
index value that specifies the entry in the stroke descriptor table. All strokes in the stream 
from that point on may use the specified stroke descriptor block until the next stroke 
descriptor index is encountered in the stream. 

[139] In an altemate embodiment, if there is no stroke descriptor index in the stream 
somewhere before the stroke, it may be assumed that this stroke should use the 0th Stroke 
descriptor block in the stroke descriptor table. And if there is no stroke descriptor table in 
the stream, then all strokes may be assumed to contain X and Y coordinates only. 

3) Transform Index 

[140] The transform index (which may be specified with the identifier TAG_TIDX) assigns a 
transform block to a stroke. A transform index may be followed by an index value that 
specifies the entry in the transform table. All strokes in the stream from that point on may 
use the specified transform block until the next transform index is encountered in the 
stream. 

[141] In an altemate embodiment, if there is no transform index in the stream somewhere 
before the stroke, it may be assumed that this stroke should use the 0th transform block in 
the transform table. And if there is no transform table in the stream, then no transforms 
should be applied to any stroke. 
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4) Metric Index 

[142] The metric index (which may be identified using the tag TAG_MIDX) assigns a metric 
block to a stroke. The metric index may be followed by an index value that specifies the 
entry in the metric table. All strokes in the stream from that point onward may use the 
specified metric block until the next metric index is encountered in the stream. 

[143] In an alternate embodiment, if there is no metric index in the stream somewhere before 
the stroke, this stroke may use the 0th (i.e., first) metric block in the metrics table. If there 
is only one metric block then the table may be omitted and the 0th metric block is the 
only metric block in the stream. 

5) Strokes 

[144] As described earlier with relation to Figure 8, strokes are one of most important 
properties in ink. Strokes contain the packet data that make up the individual points in a 
stroke and potentially other per-stroke properties as well. 

[145] As shown in Figure 13, the stroke may begin with an identifier of the stroke 1301 (for 
example, TAG_STROKE) and may be followed by a size field 1302 (e.g., that is the 
count in bytes of all the data starting from (and including) the number of point in the 
stroke 1304 to the end of the stroke). The number of points 1303 may follow the size of 
the stroke 1302 and defines how many points are stored in the packet data of this stroke, 

[146] The data contained in the stroke is found in fields after the number of points 1303. The 
packet data may include of arrays of values (X coordinate information 1304, Y 
coordinate information 1305), where each array is in the same order as described in the 
stroke descriptor for the stroke. If the stroke descriptor did not specify that no X or Y data 
was included (for example, the identifiers TAG_NO„X or TAG„NO_Y), then the first 
two arrays may be the X and Y coordinate arrays. In the situation where the stroke 
contains no packet data at all, the number of points from field 1303 may be zero. 

[147] Next, the stroke may include button states. Each button state for each point may be stored 
as a bit in the array. The button state may be stored compressed or uncompressed. The 
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number of button states may be stored (and read from the stroke descriptor). In the 
compressed storage of button states, the compressed button array uses the number of 
points* ((number of button states+7)/8) bytes in the stream to store all the state 
information for each point. Making substitutions, if one uses cPoints to represent the 
number of points and cButtons to represent the number of buttons, the resulting storage 
size for the buttons = cPoints * ((cButtons+7)/8) bytes. This storage efficiency may be 
improved by the fact that to store the same information in a non-serialized format 
requires ((cButtons +31)/8) bytes for each packet. 

[148] Stroke properties follow packet data (if non-default stroke properties are to be used). 
Stroke properties are listed by an identifier. For example, the identifier 
TAG_STROKE_PROPERTY_LIST may be used. The stroke properties may appear after 
the buttons array. In some embodiments, the stroke properties may have their tags 
omitted since the tags are specified in the stroke descriptor. The system continues to 
process all parts of the stroke. If the end of the stroke scope has still not been reached, 
then more stroke properties may be specified listing the properties with their tags. 

[149] In addition to strokes having properties, individual points may have properties as welL As 
point properties are stroke properties assigned to a point, they are defined with stroke 
properties. A sample point property list may be shown in Figure 14. The point property 
list starts with an identifier 1401 (for example, one may use TAG_POINT_PROPERTY 
as the identifier), which may be followed by the size of the whole point property list as 
provided in field 1402. After the size of the whole point property list 1402, the tagged 
point properties follow. 

[150] In Figure 14, two properties are specified, namely property 1 and property 2. The tags for 
property 1 and property 2 are specified in fields 1403 and 1407, respectively. Next, the 
index of the point within the stroke may be provided (field 1404 for property 1 and field 
1408 for property 2). Next, the size data of the properties are provided (in field 1405 for 
property 1 and in field 1409 for property 2). Finally, the data for the properties are 
provided in fields 1406 and 1410 for property 1 and property 2, respectively. 
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A Complex Example 

[151] Figure 15 shows a complex example of an ink object. The complex example includes 
number of the fields and tags described above. The entries in the ink object 1501 a 
described below. 



Ink Object Identifier 


Identifies data structure as an Ink Object 


Size/Count of the Rest of tiie 
Object 


Size/count of the rest of the ink object 


Stroke Descriptor Table Tag 


Tag for stroke descriptor table 


Size of Table Having Two 
Blocks 


Size of stroke descriptor table containing 
two blocks 


Size=l (for the first block) 


Size of the 0^ block is 1 length, only X, Y 
in these strokes 


Normal Pressure Tag 


Indicates pressure is the first thing after X, 
Y 


Size=0 (for the second 
block) 


Size of the 1^' stroke descriptor block 


Metrics Block Tag 


The only metrics block in the stream 


Size/Count of the Metric 
Block 


Size/Count of Metric block 


Normal Pressure Tag 


Metrics for X and Y are not stored in this 
example. 


Size/count for Normal 
Pressure Block 


Size/count of the normal pressure block 


Min Pressure {-10} 


Minimal allowed value for pressure, e.g. - 
100 {may specify minimum in portion} 


{10} 


No value in the stream wiU be below 10 
{Value may be separately specified} 


Max Pressure {-90} 


Maximal allowed value for pressure e.g. 
+100 {may specify maximum in portion} 


{90} 


No value in the stream wiD be above 90 
{Value may be separately specified} 


Precision {-1} 


Precision of device. E.g. 100 per unit {may 
specify value} 


{1} 


Precision of device logical units are in 
increments of 1 {value may be separately 
specified} 


Units {-2} 


Units for the precision, e.g. value indicating 
tons {may specify value} 


{2} 


Numeric value indicating units. If this 
number indicated that the units where in 



32 



Banner & Witcoff, Ltd. 
03797.00132/171946.1 





kilograms then because precision is 1 then 
each increment of logical titiit^i would he 
kilo {value may be separately specified} 


Stroke Tag 


Tag for the stroke 


Size/count of the stroke 
and related strokes 


Size/count for stroke and all its children 


Number of points in stroke 


Count of points in this stroke 


Xdata 


Compressed X coordinates 


Ydata 


Comoressed Y coordinates 


Pressurc data 


Comoressed normal nressure data 


Stroke Descriptor Table 
Index Tag 


Tag for index into a stroke descriptor table 


1 


Value for the stroke de^rintor index 


Stroke Tag 


Ta2 for stroke obiect 


Size/count of the stroke 
and related stroke 
properties 


Size/count for stroke and related r>ronertieQ 


Number of points in stroke 


Count of Points in this stroke 


Xdata 


Compressed X coordinates 


Ydata 


Comnressed Y coordinates 


Stoke Tag 


Tap for stroke obiect 


Size/count of the stroke 
and related stroke 
properties 


Size/count for stroke and related nronerries 


Number of points in stroke 


Count of Points in this stroke 


Xdata 


Compressed X coordinates 


Ydata 


Compressed Y coordinates 



[152] In the example shown, at the beginning of the stream, before any strokes, there may be a 
stroke descriptor table with two stroke descriptor block entries. The 0th entry in the 
stroke descriptor table has size zero, i.e. it may be empty. This signifies that the strokes, 
which are described by this entry, contain only X and Y vectors. The 1st entry of the 
stroke descriptor table describes the strokes that in addition to X and Y arrays contain a 
pressure vector. Here, this entry does not contain descriptors for X and Y arrays. Rather, 
they are implicit. Li contrast, only pressure is described with its minimal and maximal 
values. 

[153] Figure 15 next shows the stroke descriptor table with an index table after it. The stroke 
descriptor table index value refers to all the strokes in the ink object that follow in the 
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stream, until the end of the ink stream or until a next stroke descriptor table index may be 
found in the stream. 

[154] In the example above, the stroke descriptor table index value of 1 applies to the 0th stroke 
and the stroke descriptor table index value of 0 applies to the 1st and 2nd strokes. 
Therefore, the 0th stroke contains pressure vector in addition to X and Y vectors, the 1st 
and 2nd strokes only contain X and Y vectors. 

[155] Figures 17 and 18 describe two processes for reading a stored ink object. In Figure 17, 
the system starts reading the data structure in step 1701. Next, a tag may be read in step 
1702* If the system understands the tag (step 1703), it proceeds to read the rest of the 
block following the tag (step 1707). If the system does aot understand the tag, it reads the 
next entry, which may be the size (or count) of the rest of the block in step 1704, and 
skips past the end of the block as based on the read size (or count) in step 1705. Next, the 
system determines if there are more tags to read in step 1706. If no, then reading of the 
ink object stops in 1708, otherwise the system reads the next tag in step 1702. 

[156] In Figure 18, the size (or count of items) of the tag may be read first. This may occur by 
placing the size or count information ahead of the tag identifier. The system starts reading 
the data structure in step 1801. Next, the size or count of the block may be read in step 
1802. Next, a tag may be read in step 1803. If the system understands the tag (step 1804), 
it proceeds to read the rest of the block following the tag (step 1805). K the system does 
not understand the tag, it skips past (step 1806) the end of the block as based on the read 
size (or count) in step 1802. Next, the system determines if there are more tags to read in 
step 1807. If no, then reading of the ink object stops in 1808, otherwise the system reads 
the next tag in step 1802. The size or count of items (as read in step 1802) may or may 
not include the reference to the size or count of the tag (as read in step 1803). 

[157] While not shown, a further implementation is possible. The system may eliminate the 
size component of the data structure. To skip to the next tag, the system may scan for a 
predefined tag identifier (for example, "INK_" or the like) or any other series of values. 
Also, the system may scan for a terminator of the present tag (for example, "111111 1"). 
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Encoding of Values 

[158] The serial nature of the ink object leads to efficient storage. In some embodiments, the 
values in the ink object may be encoded to further reduce the size of the stream needed to 
store the ink object. In one example, tags are indexes to GUIDs so that the GUID may be 
not repeated unnecessarily. In the above examples, X and Y data may be compressed. A 
number of encoding strategies and compression methods may be used alone or in 
combination. 

Sizes of Tags and Numbers 

[159] At the most basic level, the ink object may be composed of numbers. Even tags may be 
considered indexes, which are just small integer numbers. In fact, most of the time these 
numbers are small enough that they could be represented by a single byte if there was a 
way of determining when a byte represented a single number and when it was just part of 
a bigger number. In some embodiments, no encoding is used. Li other embodiments, it 
may be possible to take advantage of this observation by encoding numbers using a 
multi-byte encoding technique. 

[160] Multi-byte encoding makes it possible to represent small numbers in one byte, larger 
numbers in two bytes and very large numbers in however many bytes are necessary. This 
means that tags, which usually have a value less than 100, are stored as a single byte and 
sizes, which may be small or large, are stored in the most efficient manner. In effect, 
multi-byte encoding may be a compression technique. 

[161] Various types of multi-byte encoding are known. An example of multi-byte encoding is 
shown and works as follows: 

a. Numbers less than 128 are encoded in one byte. 

b. The most significant bit remains in the byte clear. 



35 



Banner & Witcoff, Ltd. 
03797.00132/171946.1 



c. Multi-byte encoding interprets the most significant bit being clear to mean this 
may be the last byte in a number. 

d. Numbers larger than 128 are broken up into 7 bit segments. 

e. The 7 bit segments are then each stored in a byte. 

f . And the most significant bit in each byte except the last may be set. 
[162] In other words, the system handles information such that: 

a. Numbers less than 2^= 128 are encoded in a single byte. 

b. Numbers less than l}^ = 16384 are encoded in two bytes. 

c. Numbers less than 2^^ = 2097152 are encoded in three bytes. 

d. Etc. 

[163] In general, bytes are processed until a byte with the most significant bit clear may be 
encountered. For example, the first number encountered may be the ink object identifier 
number. For version LO this value may be "0" and can be encoded in a single byte. The 
next number may be the size of the stream following the size value, and for small ink 
objects as in the first example this will also be encoded in a single byte. However, if the 
stream may be long this value can grow as large as necessary. For example, a multi-byte 
encoded number of 10 bytes can represent a 64-bit number. 

[164] This same process may be applied to "tags" and other values in the stream. In general 
since "tags" are small integer indexes, they too will be one byte encoded. 

Multi-byte Encoding of Signed Numbers 

[165] Multi-byte encoding as described above works well for positive integers. However, in 
some cases it may be necessary to store signed numbers. For example, the coordinates of 
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a point may be positive or negative depending on where the application situates the 
origin. 

[166] To multi-byte encode a signed number, the absolute value of the signed number may be 
determined, the absolute value then may be shifted left by 1 bit, and the sign of the 
original number may be stored in the list significant bit. 

[167] Using the technique set forth above, the signed numbers with absolute values are handled 
as follows: 

a. Nmnbers less than 2^ = 64 are encoded in one byte, 

b. Numbers less than 2^^ = 8 1 92 are encoded in 2 bytes 

c. etc. 



Tags for Predefined and Custom GUIDs 

[168] Representing tags with indices into a GUID table may be another technique used to make 
the ink object efficient. As was akeady mentioned above, some of these GUIDS are 
predefined while others are custom to the application. The first 100 entries in the table are 
reserved and are not stored in the serialized stream. 

[169] In one example, only custom GUIDs, which may be represented by indices greater than 
100, may be stored in the serialized format. The GUID table identifier (e.g., 
TAG_GUID_TABLE) in the stream identifies the custom GUID look up table. In such an 
example, the custom GUID table may be part of the ink stream only if the ink object 
contains at least one custom GUID. 

[170] As mentioned above, the system may use multi-byte encoding for the various tags listed 
above. As long as an application does not use more than a certain number (e.g., 27) of 
custom properties in any given ink object, then the all tags may be encoded using only a 
single byte. 
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A Summarization of the Storage of Ink 

[171] Ink may be stored in an ink object with the ink object providing coordinate data and/or 
other properties associated with strokes. Compression may be used to increase the 
efficiency at which the ink may be stored. 

[172] Although the invention has been defined using the appended claims, these claims are 
exemplary in that the invention may be intended to include the elements and steps 
described herein in any combination or sub combination. Accordingly, there are any 
number of alternative combinations for defining the invention, which incorporate one or 
more elements from the specification, including the description, claims, and drawings, in 
various combinations or sub combinations. It will be apparent to those skilled in the 
relevant technology, in light of the present specification, that alternate combinations of 
aspects of the invention, either alone or in combination with one or more elements or 
steps defined herein, may be utilized as modifications or alterations of the invention or as 
part of the invention. It may be intended that the written description of the invention 
contained herein covers all such modifications and alterations. For instance, in various 
embodiments, a certain order to the data has been shown. However, any reordering of the 
data is encompassed by the present invention. Also, where certain units of properties such 
as size (e.g., in bytes or bits) are used, any other units are also envisioned. 
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